Speech enhancement minimizing generalized euclidean distortion using supergaussian priors

نویسندگان

  • Amit Das
  • John H. L. Hansen
چکیده

We introduce short time spectral estimators which minimize the weighted Euclidean distortion (WED) between the clean and estimated speech spectral components when clean speech is degraded by additive noise. The traditional minimummean square error (MMSE) estimator does not take into account sufficient perceptual measure during enhancement of noisy speech. However, the new estimators discussed in this paper provide greater flexibility to improve speech quality. We explore the cases when clean speech spectral magnitude and discrete Fourier transform (DFT) coefficients are modeled by super-Gaussian priors like Chi and bilateral Gamma distributions respectively. We also present the joint maximum a posteriori (MAP) estimators of the Chi distributed spectral magnitude and uniform phase. Performance evaluations over two noise types and three SNR levels demonstrate improved results of the proposed estimators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models

In this paper, we develop and evaluate speech enhancement algorithms, which are based on supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models in the short-time Fourier transform (STFT) domain. We consider three different statistical models, two fidelity criteria, and two approaches for the estimation of the variances of the STFT coefficients. The statistical mo...

متن کامل

Weighted Log-spectral Amplitude Estimation with Generalized Gamma Distribution under Speech Presence Probability

In this paper, we propose a speech enhancement approach. The approach is based on deriving weighted log-spectral amplitude estimator that exploits the generalized Gamma distributed speech priors under speech presence probability. The log-spectral amplitude estimator is weighted by psychoacoustically motivated speech distortion measure to take advantage of the perceptual interpretation. The expe...

متن کامل

Generalized Gamma Distributed Bayesian Estimator under Speech Presence Probability

Abstract: This paper presents an approach for speech enhancement based on the Bayesian estimator. The cost function in logarithmic domain of the Bayesian estimator is weighted by psychoacoustically motivated speech distortion measure. This weighted cost function exploits the generalized Gamma distributed speech priors under speech presence probability. The experimental results show that the pro...

متن کامل

Supergaussian Garch Models

In this paper, we introduce supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models for speech signals in the short-time Fourier transform (STFT) domain. We address the problem of speech enhancement, and show that estimating the variances of the STFT expansion coefficients based on GARCH models yields higher speech quality than by using the decision-directed metho...

متن کامل

A Robust Generalized Sidelobe Canceller Employing Speech Leakage Masking

A novel speech enhancement method based on generalized sidelobe canceller (GSC) structure is presented. We show that it is possible to reduce audible speech distortions and preserve residual noise level under acoustic model uncertainties. It can be done by constraining a speech leakage power according to masking phenomena and conditional minimizing the residual noise power. We implemented the p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009